Search CORE

119 research outputs found

Patterns versus Characters in Subword-aware Neural Language Modeling

Author: MP Marcus
N Chomsky
PJ Werbos
S Hochreiter
Publication venue
Publication date: 02/09/2017
Field of study

Words in some natural languages can have a composite structure. Elements of this structure include the root (that could also be composite), prefixes and suffixes with which various nuances and relations to other words can be expressed. Thus, in order to build a proper word representation one must take into account its internal structure. From a corpus of texts we extract a set of frequent subwords and from the latter set we select patterns, i.e. subwords which encapsulate information on character

n

-gram regularities. The selection is made using the pattern-based Conditional Random Field model with

l_1

regularization. Further, for every word we construct a new sequence over an alphabet of patterns. The new alphabet's symbols confine a local statistical context stronger than the characters, therefore they allow better representations in

{\mathbb{R}}^n

and are better building blocks for word representation. In the task of subword-aware language modeling, pattern-based models outperform character-based analogues by 2-20 perplexity points. Also, a recurrent neural network in which a word is represented as a sum of embeddings of its patterns is on par with a competitive and significantly more sophisticated character-based convolutional architecture.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Online optimal and adaptive integral tracking control for varying discrete‐time systems using reinforcement learning

Author: Bertsekas DP
Levine WS
Lewis FL
Sutton RS
Werbos PJ
Åström KJ
Publication venue: 'Wiley'
Publication date: 16/04/2020
Field of study

Conventional closed‐form solution to the optimal control problem using optimal control theory is only available under the assumption that there are known system dynamics/models described as differential equations. Without such models, reinforcement learning (RL) as a candidate technique has been successfully applied to iteratively solve the optimal control problem for unknown or varying systems. For the optimal tracking control problem, existing RL techniques in the literature assume either the use of a predetermined feedforward input for the tracking control, restrictive assumptions on the reference model dynamics, or discounted tracking costs. Furthermore, by using discounted tracking costs, zero steady‐state error cannot be guaranteed by the existing RL methods. This article therefore presents an optimal online RL tracking control framework for discrete‐time (DT) systems, which does not impose any restrictive assumptions of the existing methods and equally guarantees zero steady‐state tracking error. This is achieved by augmenting the original system dynamics with the integral of the error between the reference inputs and the tracked outputs for use in the online RL framework. It is further shown that the resulting value function for the DT linear quadratic tracker using the augmented formulation with integral control is also quadratic. This enables the development of Bellman equations, which use only the system measurements to solve the corresponding DT algebraic Riccati equation and obtain the optimal tracking control inputs online. Two RL strategies are thereafter proposed based on both the value function approximation and the Q‐learning along with bounds on excitation for the convergence of the parameter estimates. Simulation case studies show the effectiveness of the proposed approach

Crossref

White Rose Research Online

Deep Markov Random Field for Image Modeling

Author: A Dempster
Alex Graves
C Dong
E Ising
GE Hinton
GR Cross
H Rue
J Duchi
J Pearl
J Portilla
J Shotton
J Wright
James Hays
Jean-François Lalonde
M Schuster
PJ Werbos
R Timofte
S Geman
SC Zhu
SZ Li
WT Freeman
Publication venue
Publication date: 07/09/2016
Field of study

Markov Random Fields (MRFs), a formulation widely used in generative image modeling, have long been plagued by the lack of expressive power. This issue is primarily due to the fact that conventional MRFs formulations tend to use simplistic factors to capture local patterns. In this paper, we move beyond such limitations, and propose a novel MRF model that uses fully-connected neurons to express the complex interactions among pixels. Through theoretical analysis, we reveal an inherent connection between this model and recurrent neural networks, and thereon derive an approximated feed-forward network that couples multiple RNNs along opposite directions. This formulation combines the expressive power of deep neural networks and the cyclic dependency structure of MRF in a unified model, bringing the modeling capability to a new level. The feed-forward approximation also allows it to be efficiently learned from data. Experimental results on a variety of low-level vision tasks show notable improvement over state-of-the-arts.Comment: Accepted at ECCV 201

arXiv.org e-Print Archive

Crossref

Deep Burst Denoising

Author: A Foi
C Dong
C Dong
Chih-Yuan Yang
CRA Chaitanya
E Shelhamer
F Heide
F Heide
H Zhao
J Yang
JJ Hopfield
K Dabov
K Nasrollahi
K Zhang
M Everingham
M Gharbi
M Maggioni
O Ronneberger
PJ Werbos
S Farsiu
SW Hasinoff
Y Chen
Z Liu
Publication venue
Publication date: 15/12/2017
Field of study

Noise is an inherent issue of low-light image capture, one which is exacerbated on mobile devices due to their narrow apertures and small sensors. One strategy for mitigating noise in a low-light situation is to increase the shutter time of the camera, thus allowing each photosite to integrate more light and decrease noise variance. However, there are two downsides of long exposures: (a) bright regions can exceed the sensor range, and (b) camera and scene motion will result in blurred images. Another way of gathering more light is to capture multiple short (thus noisy) frames in a "burst" and intelligently integrate the content, thus avoiding the above downsides. In this paper, we use the burst-capture strategy and implement the intelligent integration via a recurrent fully convolutional deep neural net (CNN). We build our novel, multiframe architecture to be a simple addition to any single frame denoising model, and design to handle an arbitrary number of noisy input frames. We show that it achieves state of the art denoising results on our burst dataset, improving on the best published multi-frame techniques, such as VBM4D and FlexISP. Finally, we explore other applications of image enhancement by integrating content from multiple frames and demonstrate that our DNN architecture generalizes well to image super-resolution

arXiv.org e-Print Archive

Crossref

CSIT: channel state and idle time predictor using a neural network for cognitive LTE-Advanced network

Author: A Galindo-Serrano
A He
Adnan Shahid
C Clancy
F Rosenblatt
G Yuan
H Shokri-Ghadikolaei
Hyung Seok Kim
IF Akyildiz
J Mitola
JL Elman
K Tsagkaris
KI Pedersen
Kyung-Geun Lee
L Giupponi
M Bkassiny
MI Taj
O Vucenta
P Hartono
PJ Werbos
S Haykin
S Haykin
Saleem Aslam
VK Tumuluru
Y Guangxiang
Y Liang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

End-to-End Memory Networks: A Survey

Author: A Jafarian
BA Shawar
CV Negoita
J Hill
LA Zadeh
LA Zadeh
PJ Werbos
R Jafari
R Jafari
R Jafari
S Mohanty
S Razvarz
S Razvarz
S Razvarz
S Razvarz
S Young
T Araujo
Y Dote
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/07/2020
Field of study

Constructing a dialog system which can speak naturally with a human is considered as a major challenge of artificial intelligence. End-to-end dialog system is taken to be a primary research topic in the area of conversational systems. Since an end-to-end dialog system is structured based on learning a dialog policy from transactional dialogs in a defined extent, therefore, useful datasets are required for evaluating the learning procedures. In this paper, different deep learning techniques are applied to the Dialog bAbI datasets. On this dataset, the performance of the proposed techniques is analyzed. The performance results demonstrate that all the proposed techniques attain decent precisions on the Dialog bAbI datasets. The best performance is obtained utilizing end-to-end memory network with a unified weight tying scheme (UN2N)

Crossref

White Rose Research Online

The complex X-ray spectrum of NGC 4507

Author: A Abuelgasim
A Moody
AC Bernard
C Shang
CH Chao
Dibe
DL Civco
EA Wan
F Kanaya
F. Fierens
G Loung
GA Carpenter
GG Wilkinson
GG Wilkinson
GG Wilkinson
GM Foody
GM Foody
GM Foody
GM Foody
HL Maas
I Kanellopoulos
I Kanellopoulos
IV Kanellopoulos
J Ellis
J Lee
J Maslanik
J. P. Fitch
JA Benediktsson
JA Benediktsson
JD Paola
JM Key
L Jimenez
LE Pierce
M Zhang
ML Minsky
MM Fischer
MS Spina
O Hellwich
P Dreyer
P Werbos
PJ Huber
PK Simpson
R Kwok
S Furby
SS Gopal
SY Kung
T Kohonen
TW Ryan
UR Ramacher
W Wan
ZK Liu
Publication venue
Publication date: 01/01/1997
Field of study

XMM-Newton and Chandra/HETG spectra of the Compton-thin (NH 4x10^{23} cm^{-2}) Seyfert 2 galaxy, NGC 4507, are analyzed and discussed. The main results are: a) the soft X-ray emission is rich in emission lines; an (at least) two--zone photoionization region is required to explain the large range of ionization states. b) The 6.4 keV iron line is likely emitted from Compton-thick matter, implying the presence of two circumnuclear cold regions, one Compton-thick (the emitter), one Compton-thin (the cold absorber). c) Evidence of an Fe xxv absorption line is found in the Chandra/HETG spectrum. The column density of the ionized absorber is estimated to be a few x10^{22} cm^{-2}.Comment: accepted for publication in A&

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Archivio della Ricerca - Università di Roma 3

Kingston University Research Repository

A New Method for Species Identification via Protein-Coding and Non-Coding DNA Barcodes by Combining Machine Learning with Bioinformatic Methods

Author: AB Zhang
AB Zhang
AB Zhang
AB Zhang
AC Tamhane
Ai-bing Zhang
AS Babu
AVZ Brower
B Liao
BC O'Meara
C Moritz
C Wu
C Wu
C Wu
CH Wu
CP Meyer
D Steinke
David S. Milstone
DB Parker
DE Rumelhart
DE Schindel
DG Higgins
DL Swofford
DM Spooner
DS Gladstein
E Marshall
El Clare
F Austerlitz
F Rosenblatt
G Giribet
H Song
HA Ross
HC Wang
HI Jeffrey
I Meusnier
IT Jolliffe
J Dopazo
J Park
Jie Feng
Jun Wu
K Munch
K Munch
KH Chu
KL Silva-Brando
KW Will
L Breiman
L Breiman
L Frzal
L Prendini
M Elias
M Hajibabaei
M Hajibabaei
M Kimura
M Lou
M Randic
M Smith
M Wiemer
MA Larkin
MC Ebach
MJ Hickerson
MT Monaghan
N Saitou
P Kuksa
PDN Hebert
PDN Hebert
PDN Hebert
PDN Hebert
PDN Hebert
Ping Wan
PJ Werbos
Qiang Gao
R DeSalle
R Meier
R Meier
R Nielsen
RC Edgar
RD Ward
RD Ward
Robert D. Ward
S Guindon
Seo
T Ekrem
T Lefebure
TL Whitworth
TR Gregory
V Savolainen
WC Wheeler
Wei-zhong Zhao
WP Wang
Z Abdo
ZJ Zhang
Publication venue: Public Library of Science
Publication date: 20/02/2012
Field of study

Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75–100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62–98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60–99.37%) for 1094 brown algae queries, both using ITS barcodes

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins

Author: A Bakan
A Chakrabartty
A Guerler
A Guerler
A Jeltsch
A Kuzmanic
A Pintar
AC Wallace
AE Todd
AR Panchenko
AR van Erkel
AS Aranko
B Anand
B Halle
B Lee
BA Cunningham
BE Jones
C Pommie
C Vogel
CC Chang
CH Lu
CH Shih
CJ Crasto
CP Lin
CP Ponting
D Bordo
DA Case
Darren R. Flower
DL Nelson
DM Carrington
EA Ribeiro Jr
ESC Shih
FH Arnold
G Amitai
G Bulaj
G Pollastri
GS Baird
H Iwai
H Zhang
HK Liang
HM Berman
I Bahar
I Remy
J Chen
J Hennecke
J Weiner III
J Zhu
JD Pedelacq
Jenn-Kang Hwang
JM Bujnicki
JM Word
JM Yang
JR Quinlan
K Nishikawa
KH Paszkiewicz
L Chen
L Li
LC Tsai
LG Gebhard
Li-Fen Wang
M Elarabaty
M Iwakura
M Kojima
M Ostermeier
M Paluszewski
M Zavodszky
ML Connolly
MN Nguyen
PC Lyu
Ping-Chiang Lyu
PJ Werbos
R Garrett
R Vandrunen
RJ Moreau
S Akanuma
S Hovmoller
S Kundu
S Topell
S Uliel
S Uliel
SF Betz
SG Peisajovich
SJ Hubbard
ST Hsu
T Haliloglu
T Hesterberg
T Nakamura
T Noguchi
Tian Dai
TU Schwartz
V Anantharaman
V Muralidharan
W Kabsch
W Li
W Zheng
WC Lo
WC Lo
WC Lo
Wei-Cheng Lo
WR Pearson
Y Lindqvist
Y Yu
Y Zhang
Yen-Yi Liu
Z Qian
Publication venue: Public Library of Science
Publication date: 16/02/2012
Field of study

Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A characterization for the constitutive relationships of 42CrMo high strength steel by Artificial Neural Network and its application in isothermal deformation

Author: Chen MS
Dong-sen Wu
Gui-chang Luo
Guo-zheng Quan
Haghdadi N
Haghdadi N
He YB
Hecht-Nielsen R
Holthausen K
Ji GL
Ji GL
Jian-ting Liang
Jie Zhou
Li HY
Li HY
Lin YC
Lin YC
Lin YC
Lu ZL
Mandal S
Marandi A
Phaniraj MP
Quan GZ
Quan GZ
Quan GZ
Ramanujam J
Reddy NS
Sabokpa O
Serajzadeh S
Sheikh H
Voyiadjis GZ
Wen-quan Lv
Werbos PJ
Wu W
Ying-ying Liu
Zhang HJ
Zhu YC
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref